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ABSTRACT 

Exosomes are membraneous nanovesicles of 
endocytic origin released by most cell types from 
diverse organisms; they play a critical role in 
cell-cell communication. ExoCarta (http://www 
.exocarta.org) is a manually curated database of 
exosomal proteins, RNA and lipids. The database 
catalogs information from both published and 
unpublished exosomal studies. The mode of 
exosomal purification and characterization, the 
biophysical and molecular properties are listed in 
the database aiding biomedical scientists in assess- 
ing the quality of the exosomal preparation and the 
corresponding data obtained. Currently, ExoCarta 
(Version 3.1) contains information on 11261 protein 
entries, 2375 mRNA entries and 764 miRNA entries 
that were obtained from 134 exosomal studies. 
In addition to the data update, as a new feature, 
lipids identified in exosomes are added to 
ExoCarta. We believe that this free web-based com- 
munity resource will aid researchers in identifying 
molecular signatures (proteins/RNA/lipids) that 
are specific to certain tissue/cell type derived 
exosomes and trigger new exosomal studies. 

INTRODUCTION 

Exosomes are 40-100 nm diameter membrane vesicles 
of endocytic origin that are released by most cell types 
under normal and pathological conditions upon fusion 
of multivesicular bodies (MVBs) with the plasma 
membrane (PM) (1,2). Since the original observation of 
exosome secretion from reticulocytes (3,4), the release of 
these membrane particles in vitro into culture medium 
is reported in other cells (2,5-7). Exosomes have a cup 
shaped appearance as visioned by electron microscopy, 



a buoyant density in sucrose of 1.10-1.21 g/ml; and 
sediment at lOOOOOg. Exosomes harbor proteins/RNA/ 
lipids that reflect the functionality of the host cell and 
posses molecular signatures or footprints resembling the 
diseased cell from which they were secreted (7). Recent 
studies have shown that exosomes can be isolated in vivo 
in bodily fluids such as blood (8), urine (9), breast milk 
(10), amniotic fluid (11), malignant ascites (12), 
bronchoalveolar lavage fluid (13) and synovial fluid (14). 
The accessibility of exosomes in bodily fluids makes them 
an ideal specimen for biomarker discovery. 

While several protocols are employed to isolate and 
purify exosomes, characterizing them is strongly based 
on electron microscopy (size and shape) and western 
blotting for the existence of protein markers that are 
involved in exosomal biogenesis such as PDCD6IP 
(Alix), TSG101 and/or exosomal membrane molecules 
such as CD9, CD63 and CD81. As the number of 
exosomal studies expands to include more organisms 
and cell types, new insights are gained on proteins 
common to all exosomes irrespective of the cell-type 
origin as well as host-cell-specific proteins (15). A 
database of proteins, RNA and lipids identified in 
exosomes will reveal classes of molecules that are more 
often found in exosomes and those that are specific to 
certain cell types. Such additional information on host 
cell specific exosomal membrane molecules will trigger 
further immunoaffinity-based studies and aid biomedical 
researchers to isolate disease cell derived exosomes devoid 
of exosomes secreted by other normal cells. Here we 
describe, ExoCarta, a manually curated web-based 
database (http://www.exocarta.org) of the exosomal mo- 
lecular components such as lipids, proteins, mRNA and 
miRNA as reported in the scientific literature (16). More 
importantly, the database contains annotations on the 
exosomal isolation and purification procedures which 
will indicate the users on the reliability of the exosome 
preparation. 
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Figure 1. A snapshot of the protein molecule page of TSG101 in ExoCarta is displayed. Users can access molecule centric information from the 
protein molecule page in ExoCarta. The gene ontology annotations, protein-protein interaction network, graphical display of the network, experi- 
mental description pertaining to the studies that identified TSG101, mass spectrometry-based peptide identifications along with the MS/MS spectra 
and autoradiographs for western blotting are provided wherever possible as supporting information. 



ExoCarta DATABASE STRUCTURE AND 
FUNCTIONALITY 

ExoCarta was previously described 2 years ago (16). 
Briefly, ExoCarta is built with ZOPE, an open source 
Python-based web application server with dynamic 
HTML templates, scripts, a search engine and connections 
for a relational database. Python is used in the three-tier 
system to connect the web interface with a MySQL 
database. ExoCarta can be queried using the gene 
symbol/name or browsed as a set based on the organism, 
content type (proteins or lipids) and the sample material 
(plasma or urine). The molecule page in ExoCarta 
provides the users with molecule centric view on experi- 
ment description of the study that identified the molecule, 
isolation procedures employed, gene ontology-based an- 
notations, protein-protein interactions and a graphical 
display of the interaction network. Meta annotations re- 
garding the experiment description include the sample 
from which exosomes were isolated (e.g. plasma, breast 
cancer cell line), isolation procedures, the identification 
method (e.g. mass spectrometry), PubMed identifiers, in- 
vestigators of the study, the journal in which the article 
was published, the year of the publication. Protein- 
protein interaction data that was obtained from 
BioGRID (17) is used in the molecule page to display 
the protein interaction network graphically as nodes and 
edges (Figure 1). 



ExoCarta DATABASE ADDITIONS AND 
NEW FEATURES 

Data update 

Since the introduction of ExoCarta in April 2009 (16), 
the exosomal studies added to ExoCarta have doubled. 
As shown in Table 1, ExoCarta currently comprises 
11261 protein entries resulting from 4049 unique 
proteins. A total of 4049 mRNA entries and 764 
miRNA entries obtained from 134 independent studies 
are also cataloged in ExoCarta. As homogenous popu- 
lation of exosomes is of paramount importance, the 
isolation procedures used will indicate the users on 
the reliability of the exosome preparation and the corres- 
ponding data obtained. In this regard, every study in 
ExoCarta is annotated carefully for the experimental pro- 
cedures employed to isolate exosomes. Typical isolation 
procedures such as differential centrifugation, size-based 
membrane filtration, rate zonal centrifugation, flow 
cytometry and immunoaffinity pull downs are clearly 
indicated along with the floatation gradient density of 
exosomes as reported in the study. The statistical distribu- 
tion of the isolation protocols used in the studies 
catalogued in ExoCarta is displayed in Figure 2. Totally, 
84% of the studies employed differential centrifugation, 
56% used rate zonal centrifugation, 30% used ultracen- 
trifugation and 9% employed immunoaffinity-based 
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methods. It has to be noted that many of the studies used 
a combination of these procedures. For example, filtration 
(26%), cannot be used alone to isolate exosomes, was used 
in conjunction with other isolation procedures. While 
earlier exosomal studies employed the far simpler 
approaches of differential centrifugation (15,18), it is 
encouraging to see that recent exosomal studies are 
using stringent protocols to isolate exosomes to homogen- 
eity. This is evident by the fact that only 17 of the 
134 studies used differential centrifugation alone as the 
isolation procedure. 

Addition of lipids 

As a new feature of ExoCarta, lipid molecules identified in 
various exosomal studies are also added in the database. 
Recent studies on exploring the function/composition 
of lipids in exosomes have resulted in handful of lipid- 
based exosomal studies. Currently, ExoCarta catalogs 
10 lipid-based exosomal studies. Lipid composition 
analysis have been performed with exosomes derived 



Table 1. Statistics of data cataloged in ExoCarta 

1 Number of exosome studies 

2 Number of protein entries 

3 Number of proteins 

4 Number of mRNA entries 

5 Number of mRNA molecules 

6 Number of lipid molecules 

7 Number of miRNA molecules 

8 Number of organisms from which exosomes were isolated 

9 Number of cell types from which exosomes were isolated 
10 Number of body fluids from which exosomes were isolated 



134 
11261 
4049 
2375 
1639 
58 
764 
8 
93 
12 



from dendritic cells (19), basophilic leukemia cells (20), 
mast cells (19), reticulocytes (21) and B cells (22). Lipids 
play a vital role in exosome biogenesis (23) and are char- 
acteristic of the cell origin. Interestingly, internal mem- 
branes of MVBs are shown to be enriched with lipids 
such as lyosbisphosphatidic acid (LBPA) (24) which 
plays an important role in exosome biogenesis, especially 
ILV formation (23). 

Involvement of exosomal community 

Manual curation of scientific literature is a cumbersome 
process. Most of the data present in the scientific literature 
either as inline text or in the supplementary tables 
cannot be easily queried upon and are generally lost 
over time (25). Already existing exosomal datasets in the 
scientific literature either as inline text or in the supple- 
mentary tables are annotated in a communal repository 
(ExoCarta) with the aid of manual curation. However, in 
order to keep the database updated, it is very important 
for the involvement of the exosome scientific community. 
To facilitate the active involvement of the exosomal scien- 
tific community, we have incorporated data contributions 
with undue credit provided to investigators who generated 
the data. In this regard, we have instituted a mechanism 
that allows referees to access exosomal data sets that are 
submitted by authors before publication in an authorized 
and anonymous fashion for evaluation purposes. 
The uploaded data will be highlighted as private and can 
only be accessed by the authors or their collaborators. 
Alternatively, the authors can make the data publicly 
accessible whenever they are inclined to do so. By 
undertaking such ways of data deposition, the onus on 
the curators can be reduced several folds. All contributing 
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Figure 2. Distribution of exosomes isolation procedures in the studies catalogued in ExoCarta. A histogram of the distribution of exosome isolation 
procedures in exosomal studies is displayed. As shown, 84% of the studies employed differential centrifugation, 56% used rate zonal centrifugation 
and 9% employed immunoaffinity-based methods. It has to be noted that many of the studies used a combination of these procedures. For example, 
filtration (26%), cannot be used alone to isolate exosomes, was used in conjunction with other isolation procedures. 
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investigators and the datasets will be listed in ExoCarta 
credits page (http://www.exocarta.org/credits). Currently, 
eight investigators have provided their exosomal datasets 
to the database and we expect more contributions from 
the exosomal community in the near future. 

FUTURE DIRECTIONS 

ExoCarta will be updated with more exosomal studies 
and more fields including protein-protein interactions 
and post-translational modifications will be incorporated. 
Microvesicle-based datasets will also be added to 
ExoCarta in the near future but with clear annotations 
representing the type/name of the vesicle. ExoCarta will 
also function as an active news forum in terms of high- 
lighting international scientific meetings and workshops in 
the context of exosomes or microvesicles. We intend to 
involve the exosomal scientific community in updating 
ExoCarta and foresee ExoCarta as a primary resource 
for exosome research. 

DATA USABILITY 

ExoCarta is a free to use web-based database. The 
manually curated data present in ExoCarta (Version 3.1) 
is available as tab-delimited files and is free for download 
to all the users. 
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